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Department  of  Statistics 


Co-  Investigator: 


University  of  Kentucky 

V.  P.  Bhapkar 
Department  of  Statistics 
University  of  Kentucky 

Since  the  grant  involves  two  distinct  projects,  this  report  is  divided 

✓ 

into  two  sections,  number  I dealing  with  the  work  of  the  principal  investigator, 
and  number  II  dealing  with  the  co-investigator's  research. 

I.  Screening  and  Estimation  Procedures  for  the  Unknown  Number  of 
Defective  Items  in  a Life  Test,  and  Estimation  of  the  Size  of 
a Finite  Population. 

A.  Research  Accomplished  Under  the  Current  Grant 

1.)  Continuation  of  previous  research. 

V . 

In  addition  to  research  which  was  initiated  under  the  grant,  grant- 

-i  sponsored  research  time  was  used  to  make  extensive  revisions  of  several 

papers  which  had  been  submitted  for  publication.  The  revised  versions  of 

the  papers  acknowledge  the  grant  for  partial  support  of  the  research. 

These  papers  are: 
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a)  "Sequential  Estimation  of  the  Largest  Normal  Mean  When  the 

Variance  is  Known" , to  appear  in  Annals  of  Statistics.  November, 
1976. 


b)  "A  Comparison  of  the  Bad  as  Old  and  Superimposed  Renewal 
Models",  (joint  with  J.  A.  Greenwood  and  L.  H.  Herbach) , to 
appear  in  Management  Science. 

It  should  be  noted  that  this  paper  is  in  the  area  of  relia- 
bility even  though  it  is  not  the  specific  reliability  problem 
discussed  in  the  grant  proposal. 

c)  Robustness  of  Stein’s  Two-Stage  Procedure  for  Mixtures  of  Normal 
Populations,  (joint  with  Z.  Govindarajulu) , to  appear  in  Journal 
of  the  American  Statistical  Association. 


d)  Estimating  the  Complete  Sample  Size  from  an  Incomplete  Poisson 
Sample,  (joint  with  Ram  C.  Dahiya  and  Alan  J.  Gross) , submitted 
to  Journal  of  the  American  Statistical  Association.  This  work  is 
directly  related  to  the  mission  of  this  grant. 


2.)  New  Research 


Completed.  One  technical  report  was  started  and  completed  under 
sponsorship  of  the  grant.  This  was:  "Estimating  Population  Size 

with  Truncated  Sampling"  Tech.  Report  No.  96,  Department  of  Statis- 
tics, University  of  Kentucky.  Submitted  to  Communi  cat  joins  in  Statis — 
tics.  An  abstract  follows : 
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Let  Xjy  X2»  ...jXjj  be  i.i.d.  r.v. 's  with  density  f(x,9)w.r.t.  a 

^“finite  measure  (J.  Let  R be  a measurable  set  in  the  sample  space 

The  value  of  X is  observable  if  X e (x-R)  and  not  otherwise/  The 

number  J of  observable  X's  is  binomial,  N,  Q,  Q = 1-P(X  e R) . On  the  * 

basis  of  J observations,  it  is  desired  to  estimate  N and  9.  Estimators 

considered  are  conditional  and  unconditional  maximum  likelihood  (c.m.l.e. 

and  u.m.l.e.)  and  modified  maximum  likelihood  using  a prior  weight  function 

to  modify  the  likelihood  before  maximizing.  Asymptotic  expansions  are 
A a _ 

developed  for  the  N's  of  the  form  N = N + a/H  + 0 + bp(l) , where  a and  6 
are  random  variables.  All  estimators  have  the  same  a,  which  has  mean 
0,  variance  a2  (a  function  of  0)  and  is  asymptotically  normal.  Hence, 
all  are  asymptotically  equivalent  by  the  usual  limit  distributional 
theory.  The  0's  differ  and  E0  can  be  considered  an  "asymptotic  bias". 
Formulas  are  developed  to  compare  the  asymptotic  biases  of  the  various 
estimators.  For  a scale  parameter  family  of  absolutely  continuous 
distributions  with  x * (0,®) , and  R - (T,~) , special  formuli  are  developed 
and  a best  estimator  is  found. 

b.)  Ongoing.  Several  projects  are  currently  in  progress  under  grant 
sponsorship. 

1.  The  major  project  now  under  way  is  the  preparation  of  a 

monograph  to  be  entitled  "Statistical  Methods  for  Truncated 
and  Other  Incomplete  Data."  This  is  joint  with  L.  Sana than an. 
This  book  will  cover  the  results  of  the  two  co-authors  on  the 
estimation  of  sample  size  from  truncated  observations (which  is 


In  a previous  paper,  the  author  gave  a sequential  screening 
procedure  for  the  case  in  which  F(t)  was  known  completely, 
and  results  were  stated  in  terms  of  the  exponential  distri- 
bution. The  procedure  could  be  used  if  the  distribution 
contained  an  unknown  scale  parameter,  provided  that  a bound 
was  given  on  the  value  of  the  parameter.  However,  it  would 
be  an  inefficient  procedure  in  that  case  unless  the  parameter 
value  was  actually  near  the  bound.  We  have  been  investigating 

a "staged”  sequential  procedure  which  takes  observations  for 

A 

a fixed  period  of  time  Tq,  then  computes  an  estimate  9 of  the 
unknown  scale  parameter  6 and  thereafter  uses  the  same  sequen- 

A 

tial  stopping  rule  as  in  the  earlier  paper  except  that  9 is 
used  in  place  of  9.  The  goal  is  to  guarantee  with  probability 
P*  that  no  more  than  K (P*  ,K  given)  defectives  remain  after 
sampling  terminates.  Not  only  must  we  choose  the  constants  for 
the  stopping  rule  as  in  the  earlier  work,  but  Tq  must  also  be 
chosen.  Some  analytic  results  have  been  obtained,  but  the  prob- 
lem is  not  readily  amenable  to  analysis.  Currently  Monte  Carlo 
studies  are  underway  to  determine  the  probabilities  of  various 
numbers  K remaining  for  different  choices  of  Tq,  of  the  stopping 
constants,  of  9,  N,  and  different  estimators  of  9.  From  these, 
we  hope  to  be  able  to  recommend  approximately  optimal  rules  to 
assure  the  specified  goals.  We  expect  this  to  be  a major 


advance  in  making  these  screening  procedures  accessible  for  use 
in  realistic  situations.  It  should  be  noted  that  J.  M.  Finkel- 
stein  and  R.  E.  Schafer  of  Hughes  Aircraft  Co.,  Fullerton, 
California,  in  an  unpublished  manuscript  entitled,  "A  Sequential 
Screening  Procedure  for  the  Exponential  Distribution" , have  made 
extensive  Monti  Carlo  studies  of  the  properties  of  an  ad-hoc 
sequential  screening  procedure  designed  to  eliminate  the  effect 
of  an  unknown  scale  parameter.  From  this  study,  it  can  be  pre- 
sumed that  good  procedures  of  this  type  are  of  potential  value  in 
the  aircraft  industry. 

For  fixed  amount  of  sampling  when  F(t)  es  exponential  with  unknown 

failure  rate  X,  the  limiting  variance  of  the  M.L.E.  is  an  unbounded 

function  of  X.  Basically  the  difficulty  lies  not  with  some  defect 

in  the  maximum  likelihood  process  but  with  the  difficulty  of 

estimating  N on  the  basis  of  insufficient  information.  For 

instance,  for  fixed  time  sampling,  the  estimator  behaves  quite 

well  if  T > tQX  (tQ  any  constant).  But  if  X is  unknown,  no 

fixed  T cam  be  assured  to  be  sufficiently  large , and  what  would 

appear  to  be  needed  is  either  a two-sample  or  sequential  rule 

A 

which  generates  an  estimate  X and  assures  that  the  ultimate  test 
A 

duration  exceeds  K X,  for  an  appropriate  K.  Such  a rule  should 
give  good  properties  to  any  estimator,  including  the  maximum 
likelihood  one.  In  particular  such  a rule  should  lead  to  bounds 
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on  the  variance  of  the  estimator.  A two  stage  rule  with  a 

fixed  duration  TQ  in  the  first  stage  and  a second  stage 

A A 

duration  proportional  to  X where  X is  based  on  the  first 
stage  observations  is  now  being  studied  by  a graduate  student 
under  the  supervision  of  the  present  investigator. 


6.  Previous  research  has  concentrated  on  finding  a modified 
maximum  likelihood  estimator  which  minimizes  the  asymptotic 
second  order  bias.  Additional  terms  in  our  series  expansions 
are  being  developed  to  allow  selection  of  an  estimator  which 
minimizes  asymptotic  second  order  mean  squared  error.  Research  is 
being  conducted  by  a graduate  student  under  the  supervision  of 
the  principal  investigator. 


7.  A problem  of  interest  in  reliability  and  related  to  this  grant's 
purposes  is  the  estimation  of  the  parameter  of  the  exponential 
distribution  when  the  parameter  changes  with  time.  In  joint 
research  with  A.  Gross,  this  problem  is  under  exploration. 


Consultants.  During  the  past  year,  the  following  visitors  came  to 
Kentucky  under  Grant  sponsorship  to  discuss  mutual  research  interests. 
All  of  these  consultants  proved  helpful  in  furthering  the  research 
efforts  of  the  investigators.  Two  of  the  visitor  have  become  engaged 
in  joint  research  projects  with  the  principal  investigator  as  a 
direct  result  of  the  invitation  (see  section  A-2-b-l,  2,  3,  and  7). 


I 
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1. )  Professor  Ram  C.  Dahiya,  Department  of  Mathematics  and  Statis- 

tics, University  of  Massachusetts. 

2. )  Professor  Alan  J.  Gross,  Department  of  Biometry,  Medical 

University  of  South  Carolina. 

3. )  Professor  Leus  Holst,  Mathematics  Research  Center,  Madison, 

Wisconsin,  and  Uppsala  University,  Sweden. 

4. )  Professor  Arthur  Roth,  Statistics  Department,  Carnegie-Mellon 

University. 

5. )  Professor  Lalitha  Sanathanan,  Department  of  Quantitative 

Methods,  College  of  Business  Administration,  University  of 
Illinois  at  Chicago  Circle. 
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II.  Multivariate  Nonparametric  Methods 

for  Several  Samples 

Asymptotically  distribution-free  tests  for  comparing  several  samples 
in  the  multivariate  case  were  offered  in  the  statistical  literature  during 
the  last  ten  years  in  view  of  the  need  to  discard  the  stringent  assumption 
of  multivariate  normality.  However,  in  view  of  their  computational 
complexity,  these  theoretical  advances  had  not  yet  filtered  down  to  the 
application  stage. 

During  the  first  phase  of  this  research  suitable  computer  programs 

\ 

were  developed  for  the  computation  of  test-criteria  for  comparing  three 
samples  with  two  or  three  correlated  variables.  Simulations  were  then 
carried  out  using  random  samples  generated  from  three  diverse  types  of 
populations  in  order  to  study  the  performance  of  these  criteria  which. are 
theoretically  valid  only  for  large  samples. 

In  the  second  phase  of  this  research,  asymptotically  distribution- 
free  test-criteria  ha^been  developed  for  the  "profile  analysis"  of  several 
multivariate  samples.  Suppose  that  we  have  independent  random  samples  of 
size  n^  from  the  i-th  p-variate  population  with  c.d.f.  F^,  i=l,2, . . . ,k. 

The  earlier  nonparametric  tests  have  dealt  with  the  overall  null-hypothesis 


i.e.  there  are  no  differences  among  k populations.  However,  if  there  are 
some  differences,  one  is  often  interested  in  finding  out  whether  the 
pattern  of  differences  among  populations  is,  in  some  sense,  uniform  across 
variables.  In  other  words,  one  wants  to  know  whether  there  is  any 
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interaction  between  populations  and  variables.  Under  the  assumption  of 
multivariate  normality  with  common  nonsingular  covariance  matrix,  the 
hypothesis  of  no  interaction  is  then  formulated  as 


V vp)-vi^1)  - 


f ^ i • • • j 2,««.,p; 


,0) 


here  is  the  mean  of  the  j-th  variable  for  the  i-th  population.  If  we 

draw  the  "profile"  of  the  i-th  population  by  plotting  its  means  against 
the  corresponding  variable,  the  hypothesis  ^ can  be  interpreted  as 
parallelism  of  profiles  (See  Figure  1.) 
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It  is  then  seen  that  such  an  hypothesis  of  parallelism  is  meaningful 
(and  of  interest)  only  when  the  variables  are  commensurable,  i.e.  are 
comparable  in  some  strong  sense  (e.g.  repeated  measurements  at  different  time 
points,  or  test  scores  in  different  correlated  categories  on  some  common 
scale) . 

Discarding  the  assumption  of  normality  and  to  a considerable  extent 
that  of  commensurability,  we  have  developed  nonparametric  criteria  for 

such  a profile  analysis.  First,  a suitable  nonparametric  analog  of  H^, 

* 

say  H , has  been  formulated  as  the  hypothesis  of  no  population-variable 

* 

interaction.  The  nonparametric  criteria  are  then  developed  to  test 
by  appropriate  modifications  of  the  criteria  to  test  the  overall  Hq.  The 
mathematical  and  other  details  concerning  their  theoretical  properties 
have  been  developed  and  are  given  in  the  following  technical  report: 

V.P.  Bhapkar  and  K.W.  Patterson,  "On  Some  Nonparametric 
Tests  for  Profile  Analysis  of  Several  Multivariate  Samples," 
University  of  Kentucky,  Department  of  Statistics,  Technical 
Report  No.  100,  March  1976. 

Finally,  simulation  studies  similar  to  these  in  the  first  phase  were 
carried  out  in  order  to  study  the  performance  of  these  new  criteria.  The 
findings  of  both  these  simulation  studies  are  reported  in  the  following: 

V.P.  Bhapkar  and  K.W.  Patterson, "A  Monte  Carlo  Study  of  Some 
Multivariate  Nonparametric  Statistics  for  Profile  Analysis  of 
Several  Samples,"  University  of  Kentucky,  Department  of  Statistics 
Technical  Report  No.  101,  March  1976. 


